This file contains descriptions of the contents of the datasets.


ABbySample: Contains matched audits and backchecks (ie, each audit has a backcheck as a separate observation). Data is at plant X audit year X seasonal visit level. 

ABbySample_reshaped: ABbySample, reshaped to the plant X audit year X seasonal visit X pollutant level.

ABbyParameter: Contains matched audits and backchecks. Data is at plant X audit year X seasonal visit X pollutant level. This file has all the same data as ABbySample, but is reshaped so the audits and backchecks for a particular parameter are in the same observation.

assignments: Contains information on assignments, as well as a number of time-invariant plant characteristics.

auditData: All audits received by GPCB, at the plant X audit year X seasonal visit level.

auditData_byParameter: All audits received by GPCB, at the plant X audit year X seasonal visit X pollutant level.

endline: Endline pollution measurements conducted by credible auditors, at the plant X pollutant year.

full_endline: Endline pollution measurements and fixed plant characteristics, at the plant level.

limits: Pollution limits for each pollutant (and dependent on whether pollution flows to Common Effluent Treatment Plant). Also contains units pollutants are measured in.

